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first-order logic FO^[<,suc] over finite words with linear order and binary suc- 



cessor predicate. We give a single identity of omega-terms for each level of this 
hierarchy. This shows that it is decidable for a given regular language and a non- 
, negative integer m, whether the language is definable by a formula in FO^ [<, sue] 

^w' ' which has at most m quantifier alternations. We also consider the alternation 

hierarchy of unary temporal logic TL[X, F, Y, P] defined by the maximal number 
C/3 ■ of nested negations. This hierarchy coincides with the FO^[<,suc] alternation 

, ^ , I hierarchy. 
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o ' 

vTi ' 1 Introduction 

' Around 1960, Biichi, Elgot and Trakhtenbrot independently showed that monadic second- 

order logic (MSO) over finite words defines exactly the class of regular languages pi I32| . 
' Since then, numerous fragments of MSO have been considered. A theoretical motivation for 

fragments is the study of the rich structure within the regular languages. For this purpose, 
fragments form the basis of a descriptive complexity theory: The simpler the formula for 
defining a language, the simpler this language is. In contrast to classical complexity theory, 
in some cases one can actually check whether a given language lies in a certain descriptional 
, complexity class. From a practical point of view, simpler fragments often lead to more 

efficient algorithms for decision problems such as satisfiability. 

The most prominent fragment of MSO is first-order logic FO. The atomic predicates 
of FO are the unary predicate X{x) — a stating that position x is labeled by the letter a, 
and the binary predicates x — y and x < y with the natural interpretation. The successor 
predicate suc(2:, y) is easily definable in FO by saying that x < y and that there is no 
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position between x and y. McNaughton and Papert showed that a language is FO-definable 
if and only if it is star-free [T^. Combined with Schiitzenberger's characterization of star- 
free languages in terms of finite aperiodic monoids |20J, it follows that a language is FO- 
definable if and only if its syntactic monoid is aperiodic. The latter property is decidable and 
thus, one can effectively check whether a regular language (given e.g. by a nondeterministic 
automaton or an MSO formula) is definable in FO. The two most famous hierarchies 
within FO are the Straubing-Therien hierarchy and Brzozowski's dot-depth hierarchy. The 
Straubing-Therien hierarchy coincides with the quantifier alternation inside FO without the 
successor predicate |24l[28], and Brzozowski's dot-depth hierarchy is captured by quantifier 
alternation including the successor predicate [3]; see also [111 [301 ■ Here, quantifier alternation 
is defined in terms of blocks of quantifiers for formulae in prenex normal form. Note that 
by introducing new variables, every formula can be converted into prenex normal form. 
Deciding membership for level m of one of the hierarchies for a given regular language is one 
of the most challenging open problems in automata theory. To date, for both hierarchies 
only the very first levels (i.e., m = f) are known to be decidable [51 123j. 

By Kamp's Theorem [5], first-order logic FO^ with only three different names for the 
variables and full first-order logic FO have the same expressive power. Two variables are 
not sufficient for defining all first-order definable languages; for example {ac*hc*)* is not 
definable in first-order logic with two variables FO^, even in the presence of the successor 
predicate. The fragment FO^[<] without successor predicate has a huge number of different 
characterizations; see e.g. [SldT]- of them is the variety DA of finite monoids [2f) : 

c/. |29j . For quantifier alternation inside FO^ one cannot readily rely on prenex normal forms. 
However, in FO^ negations can be moved towards the atomic formulae, and hence every 
formula is equivalent to a negation-free counterpart. The fragment FO^j consists of all FO^- 
formulae whose negation-free counterpart has at most m blocks of quantifiers on each path of 
the parse tree. Kufleitner and Weil have shown that for every m > f it is decidable whether 
a given regular language is definable in FO^[<] without successor predicate [f5j. They have 
given an effective algebraic characterization in terms of levels of the Trotter- Weil hierarchy of 
finite monoids [33] ; see also [T?! . In addition, restrictions of many other characterizations of 
the FO^[<]-dcfinable Ian guages admit algebraic counterparts within this hierarchy |f 21 116| . 
The proof of Kufleitner and Weil's characterization of F0^[<] relies on a combinatorial 
tool known under the terms ranker [34] and turtle program |22| . A connection between 
FO^[<] and rankers was established by Weis and Immerman [33] and further exploited by 
Kufleitner and Weil [TB]. Straubing has given another algebraic characterization of FO^J<] 
in terms of weakly iterated block products of J^-trivial monoids [25]. Recently, Krebs and 
Straubing ^lOj were able to use this characterization for giving identities of omega-terms for 
FOf,J<], thereby obtaining another effective characterization of FO^J<]. 

In this paper, we consider the quantifier alternation hierarchy inside FO^[<,suc] with 
successor predicate. The logic FO^[<,suc] is strictly more expressive than FO^[<] without 
successor. For example, the language {ah)* is definable in FO^[<,suc] but not in FO^[<]. 
Therien and Wilke [29] gave an algebraic characterization of FO^[<,suc] which, by a pre- 
vious result of Almeida, is known to coincide with the decidable variety LDA of finite 
semigroups [T]; see also [3]. For every to > 2 we give a single identity of omega-terms such 
that a language is definable in FOf„[<,suc] if and only if its syntactic semigroup satisfies 
this identity. In particular, it is decidable whether a given regular language is FOf„[<, suc]- 
definable. 

Our proof is by induction on to and the base case is Knast's Theorem on dot-depth one 
For TO = I, there is a small difference between the availability and the absence of min- and 
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max-predicates; this is identical to the situation for dot-depth one [llj. The main ingredients 
of our proof are (i) string rewriting techniques, (ii) combinatorial properties of LDA, and 
(Hi) relativization techniques for FO^. As a byproduct, we show that quantifier alternation 
in FO^[<,suc] coincides with alternation in unary temporal logic TL[X, F,Y, P] where the 
latter is based on the nesting depth of negations. This last property can also be seen using 
translations between FO^ and unary temporal logic given by Etessami, Vardi, and Wilke |7j. 

2 Preliminaries 

Throughout, A denotes a finite alphabet. The set of all finite words is A* and the set of 
all finite, nonempty words is A'^ . For m = ai • • • a„ with G A, the length of u is — n 
and the k-f actor alphabet is alphj,(w) — {a^ • • • Oi+fe-i Gyl'^| l<.i<n — k + The set 
of positions of u is pos(u) = {1, . . . , n}. If / is an interval, then u[I] denotes the factor of u 
covered by the interval of positions pos(m) fl /. If / = j], then u[i\j] is an abbreviation 
for u[I]. In particular, if 1 < i < j < n, then j] — ai - ■ -Oj. If i ~ j £ pos(m), then we 
simply write u[i] to denote the j*^ letter of u. 

First-Order Logic and Unary Temporal Logic. We consider first-order logic over finite 
words with order and successor predicates. Atomic first-order formulae are T for true, _L for 
false, label predicates X{x) = a for a G A, comparisons x = y, x < y and successor suc(a;,y) 
as well as minimum min(a;) and maximum max(a;). Words form models for formulae as 
labeled, linearly ordered sets of positions and x and y are variables ranging over positions. 
Formulae can be composed by the usual Boolean connectives, i.e., if tp and -0 are first- 
order formulae, then so are their disjunction ip W 4'^ their conjunction ip A ip, and the 
negation -^ip. Moreover, formulae can be composed by existential quantification 3xip and 
universal quantification \/xip. The semantics is as usual; see e.g. |13l I31| . We use the 
notation ip{xi, . . . , Xn) to indicate that at most the variables xi, . . . ,Xn occur freely in ip. 
We write u \= <^(zi, . . . , in) for u € A* and positions ij € pos(m) if tp is true over u with Xj 
being interpreted by ij. A formula without free variables is a sentence and in this case 
we simply write u \^ tp. For any class J- of first-order formulae, J-[C] is the restriction 
to formulae which, apart from T, _L, label predicates, and equality, only use predicates in 
C C {<, sue, min, max}. 

The fragment FO^ = FO^ [<, sue, min, max] of first-order logic contains all formulae which 
use at most two different names for variables, say x and y. For FO^-formulae f{x) with free 
variable x we stipulate the convention that ip{y) is the formula obtained by interchanging x 
and y. Using De Morgan's laws and the usual dualities between existential and universal 
quantifiers, one can see that every formula in FO^ is equivalent to a formula with nega- 
tions only applied to atomic formulae. We call such formulae negation-free (since negative 
predicates could be added to an extended signature). The fragment FO^ consists of all 
formulae in FO^ with quantifier alternation depth at most m, i.e., formulae such that the 
negation-free counterpart has at most m blocks of quantifiers on every path of the parse tree. 
(Thus if we drop the two-variable restriction, every FO^-formula admits a prenex normal 
form with m blocks of quantifiers). In other words, negation- free formulae in FO^ have at 
most m — 1 alternations of nested existential and universal quantifiers. Note that FO^ is 
closed under negation. The fragment FO^ „ contains all formulae in FO^„ with quantifier 
depth at most n. 
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Unary temporal logic TL[X, F, Y, P] consists of all formulae built from T for true, ± for 
false, labels a for a £ A, compositions using Boolean connectives as in first-order logic, 
and temporal modalities X(p, F (p, Y (p, and P for e TL[X, F,Y, P]. Formulae of unary 
temporal logic are interpreted over a word together with a position. The semantics is 
declared by the following FO^-formulae in one free variable: Let a{x) = [X{x) = a) and 

(X(^)(a;) = 3y{suc{x,y) Aip{y)), (Fi^)(a;) = 3y {x < y A ip{y)) , 
{yip){x) = 3y{suc{y,x) Aip{y)), (P'^)(a;) = 3y [y < x A ip{y)) . 

Here and in the sequel, = means syntactic equality. We often use this symbol instead of 
equality in order to avoid confusion with the symbol = occurring as atomic predicate. The 
formulae for the remaining constructs are as usual. The modalities X (neXt) and F (Future) 
are called future modalities whereas the modalities Y (Yesterday) and P (Past) are called 
past modalities. In order to define u \= without a distinguished position in u. we start 
evaluation in front (position 0) for future modalities and after (position |m| + 1) the word u 
for past modalities. More formally for a word u e A*, we define u ^ a and 

'u|=X(p if and only if u |= <^(1), u\=Vip if and only if u\=F ip{l), 
u\=y if and only if m |= <^(|u|), u\=Pp) if and only if m |= Pt^dul). 

Boolean connectives and atomic formulae T and ± are defined as usual. For example, the 

formula Xa A Y6 defines the language aA*h. Let TLm[X, F, Y, P] be the fragment of unary 
temporal logic consisting of the Boolean combinations of formulae with at most to — 1 nested 
negations. Let TL^.^fX, F, Y, P] consist of all formulae in TLto[X, F, Y, P] with operator 
depth at most n, i.e., there are at most n nested temporal modalities. 

For a formula </? in first-order logic or in unary temporal logic, let L(}p) = {u € A'^ \ u\= (p} 
be the language defined by ip. 

Algebra. Throughout this paper all semigroups are nonempty. Let S" be a finite semigroup. 
An element x e 5 is idempotent if = x. The set of all idempotents of 5 is denoted 
E{S). For every finite semigroup S there exists an integer a; > 1 such that each w-power 
is idempotent in S. Green's relations are an important concept in the structure theory of 
finite semigroups: For x,y € S let x <tz y if x = y ov x € yS and symmetrically let x <c y 
\{ X = y ov X E Sy. For Q G {TZ, C\ let x Q y li both x <g y and y <g x; and let x <g y if 
X <g y but not y <g x. We also view S as an alphabet and write u S S'^ for a word with 
letters from S. For words u,v G we say that a relation uQ v "holds in S"', if the relation 
is satisfied after evaluating u and v in S. We use this frequently for equality and Green's 
relations. 

Frequently, classes of semigroups are defined by identities of omega-terms. An omega- 
term over a set of variables E is defined inductively. Every .t G S is an omega-term, and if u 
and V are omega-terms, then so are uv and u'^. A finite semigroup S satisfies the identity 
u = V if for each homomorphism h : S+ — )• 5 we have h{u) = h{v). Here, h is extended to 
omega-terms by letting h{u^) be the idempotent generated by h{u). 

Next, we define classes Wm of finite semigroups. To this end, we inductively define 
sequences of omega-terms Um, Vm with variables e, /, Xj, yt, s, t,pi, qi. For m > 2 we let 

Ui = (e"s/"a;ie")"s(/"2/ie'"'i/'"')'"', Um = iP7nUm-iqmXm)'^PmUni-iqm{ymPmUm-iqm)'^ , 
Vi = (e"s/"xie")"t(/"t/ie"i/")", Vm = {PmUm-iqmXmT PmVm-iqm{ymPmUm-iqm)" ■ 
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By definition, a semigroup is in W„i if it satisfies the identity Um — Vm- The class Wi 
is Knast's algebraic characterization of dot-depth one [S]- The only difference between Ui 
and Vi is the central variable in Ui being s and in Vi being t. Intuitively, this difference is 
more and more "shielded" in Um and Vm with increasing m. 

For every e G E{S) the set eSe forms the so-called local monoid at e. A semigroup S 
belongs to LDA if every local monoid eSe satisfies {xy)'^ x{xyY = {xy)'^ ■ This is equivalent 
to saying that we have (exeye)^ exe{exeye)^ = {exeye)^ in 5" for all x,y E S and all e G E{S). 
Note that if S is in LDA and if e G E{S) and x, y G e5e then, 

{xyr = (x2;)--ix(ya;)"y = {xyr-^x{yxry{yxry = (a;2/)2"y(xyr = (2:yr2/(:ry)". 

Thus despite its asymmetric definition, LDA is left-right-symmetric. 

A homomorphism h : — !■ S* to a finite semigroup S recognizes a language L C 
if h~^{h{L)) = L. A semigroup S recognizes L C A'^ if there exists a homomorphism 
h : —5- S which recognizes L. For u, u G let u =l w if pwg G L is equivalent to 
pvq G i for all p,q G A* . The relation = l over A'^ is a congruence and the semigroup 
A'^/=L, also denoted by Synt(L) and called the syntactic semigroup of L, is the unique 
minimal semigroup recognizing L. Moreover, it is effectively computable (e.g. from an 
automaton for L); cf. [18J. 



3 Alternation within Two- Variable First-Order Logic with 
Successor 

The following result is the main contribution of this paper. The remainder of this section is 
dedicated to its proof. 

Theorem 1 Let m > 2 and let L C A~^ . The following assertions are equivalent: 

1) L is definable in FO^[<,suc]. 

2) L is definable in TLm[X, F, Y, P]. 

3) Synt(L) G W„,. 

Before turning to the proof of Theorem [1] we record the following decidability corollary. 
For TO = 1 it relies on a characterization of two-sided ideals inside dot-depth one 

Corollary 1 For every positive integer m one can decide whether a given regular language 
L C yl+ is definable in FO^[<,suc]. 

Proof: By Theorem [U it remains to prove the case m — 1. A language is definable in 
FO^[<,suc] if and only ifit is Boolean combination of languages of the form A* wi A* • • -wtA*. 
This is known to be equivalent to Synt(L) G Wi and the image of L under the syntactic 
homomorphism being a union of so-called jT'-classes, cf. Both properties can be verified 
effectively. □ 

In Subsection 13.11 we start with the implication from ([3]) to ((TJ in Theorem [T] This is 
essentially Proposition [T] whose proof requires some preparatory work: We first show that 
every W™ is contained in LDA (Lemma[T]) which enables us to use a combinatorial property 
of LDA (given in LemmaS]). Then a relativization technique for FO^ (Lemma[5]) is used for 
defining a congruence ~m,n (Definition [T]) as a tool for FO^. The connection between this 
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congruence and FO^j is established by Lemma [71 Using a string rewriting system, a special 
factorization (given in Lemmaj^]) finally leads to an inductive scheme to prove Proposition[T] 
In Subsection l3.2l the reverse implication from ([T]) to ([3]) is given by Lemma[TT] It is shown 
that if a language is definable in FO^J<,suc], then it is recognized by a semigroup in W„i. 
Since the syntactic semigroup is a divisor of every recognizing semigroup, it follows that the 
syntactic semigroup of the language is in • Lemma [TO] is an intermediate result which 
shows how to reduce the alternation depth by encoding some information into an extended 
alphabet. 

Finally in Subsection l3.3l Proposition[2]incorporates unary temporal logic into the picture. 



3.1 From Algebra to First-Order Logic 

Lemma 1 For all m > 1 we have W„i Q LDA. 

Proof: Let 5 be a finite semigroup and let a; > 1 be an integer such that a;" is idempotent 
for all X (z S. Let x,y S and let e G 5 be idempotent. Setting ei = fi = s ~ e, 
xi = xey, yi = X, t — y we get Ui = (exeye)" in S and Vi — (exeye)'^ eye{exeye)'^ in S. 
Setting all other variables occurring in Um or in Vm to be e, we see Um = (exeye)" in S 
and Vm — (exeye)'^ eye{exeye)'^ in S. Thus if 5 G W„j and e G E{S), then 65*6 satisfies the 
identity (xy)'^ = {xyYy{xy)'^, i.e., S G LDA. □ 

The next lemma is an intermediate result for Lemma [3] and Lemma 3] both of which yield 
important combinatorial properties of semigroups in LDA. 

Lemma 2 Let S G LDA, let x,y, z e S, and let e G E{S). 

1) If xe TZ ye in S , then xe TZ xez if and only if ye TZ yez. 

2) If ex C ey in S, then ex C zex if and only if ey C zey. 

Proof: Since LDA is left-right symmetric, it suffices to show ([Ij. Suppose xe TZ xez. 
Since ye TZ xe TZ xez there exist s, t such that xe = yes and ye = xezt. We get ye = 
ye{esezte) . Pumping the factor in the parentheses and using LDA yields ye = ye{esezte)'^ = 
ye{esezteY ezte{esezteY £ yezS. □ 



Lemma 3 Let S G LDA, let u, u G , let s,t G S* , let \v\ > \S\ and let alph|5|^;^(ws) = 
alph|s|+i(wt). 

1) If uTZuv in S, then u TZ uvs in S if and only if uTZ uvt in S . 

2) If u C vu in S , then u C svu in S if and only if u C tvu in S . 

Proof: Since LDA is left-right symmetric, it suffices to show ([T]). Assume u TZ uv TZ uvs 
in S. We want to show u TZ uvt in S. This is trivial if t is the empty word. Otherwise we 
factorize vt = pwz such that \w\ < \wz\ = |5'| -I- 1 with w = we in S for some idempotent e 
of S. Note that every sequence xi, . . . , X|5| G S has a prefix which admits an idempotent 
stabilizer, i.e., there exists i G {!,..., |5|} and e G E{S) such that 

in S; see e.g. jilt Lemma 1] for a proof of this claim. Since vs and vt have the same factors 
of length l^l -|- 1, we find a factorization vs = siwzs2. Let x — usiw and y — upw. By 
induction uTZ y and thus xe = x TZ y = ye in S . Moreover, xe TZ xez and by Lemma [2] we 
see ye TZ yez in S. This implies the claim. □ 
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Choosing s to be the empty word and t = a immediately yields the following consequence. 

Lemma 4 Let S G LDA, let u,v £ 5+, let a e S and let \v\ > \S\. 

1) If u TZ uv >Ti uva in S, then alp]i^g^_^^i{v) ^ alph^g^^^^va). 

2) If u C vu >c civu in S, then alph|g|_(_j^ (u) alph|g|_|_]^(a?j). □ 

The next lemma gives the main combinatorial properties of FO^„[<, sue] for our purpose, 
namely relativizations of formulae to certain factors of deterministic factorizations. 

Lemma 5 Let ip G FO'^[<,suc] and let v,w £ A+. 

1) There exist formulae (<yj)<xm and {ip)>xw such that for all u = U1WU2 with a unique 
occurrence of the factor w in the prefix uiw we have 

u h {v)<Xw{i,j) iff ui 1= (p{ij) for all 1 < i, j < |ui|, 

" h {v)>Xwi'i,j) iff U2 h fii - \uiw\ ,j - \uiw\) for all \uiw\ <i,j < \u\. 

2) There exist formulae {^)<yv o-nd {(p)>Yv such that for all u = uivu2 with a unique 
occurrence of the factor v in the suffix VU2 we have 

u h {v)<yv{i,j) iff ui h= "pii,]) for all 1 < i,i < \ui\, 

u h {v)>yv{i,j) iff U2 \= ip{i - \uiv\,j - \uiv\) for all \uiv\ <i,j < \u\. 

3) There exists a formula {'~p)[y^w] such that for all u = U1VU2WU3 with a unique occurrence 
of the factor v in the suffix VU2WU3 and a unique occurrence of the factor w in the prefix 
U1VU2W we have 

u h {v)[v-w]^i,j) iff "2 h '^{i - \uiv\ ,j - \uiv\) for all \uiv\ < i,j < |-uit;-U2|- 
Moreover, if if £ FO^j „[<, sue], then 

1) {f)<xw e FO^„+i „+|„|[<,suc] and {lp)>xw e FO^„ „+|„| [<, sue], 

2) {v)<yv e FO,^„ „+|„|[<,suc] and {(p)>yv G FO^„+i.„+|^| [<, sue], and 

3) {'^)[vM e FO,^„+i „+jv[<>suc] for N = max{|w| , \w\}. 

Proof: The properties of the factorizations of u mean that we actually consider the first 
occurrence of the factor w (the last occurrence of the factor u, respectively). The letter X 
and Y derive from modalities of unary temporal logic where X^u is for neXt-it; which goes to 
the first occurrence of w, and Y„ is for Yesterday-w which goes to the last occurrence of v. 

"([1])": The idea for (iy9)<xto and (i^)>xu) is to restrict the interpretation of a formula 
to the part of the model which lies before (respectively after) the first occurrence of w. 
The construction is by structural induction. We first construct the formula {ip)<^\w Let 
(i^)<Xto = (/9 if is an atomic formula. Disjunction is given by V ■(/')<Xiu = {y^)<Xw V 
(V')<Xt«, conjunction by {ipA^)<xw = (<p)<x«, A {ip)<xw, and negation by {-^Lp)<xw = 
~'{v)<Xiu- For existential quantification we let 

{3xip)^Xw = 3x {{^3y < x : X{y) ^ w) A {ip) <^xw) ■ 

Here A(y) — w requires that the factor w begin at position y. More formally, let X(y) = w 
be defined as X{y) ~ w if w £ A and as X{y) = a A {3x: suc{y,x) A X{x) = q) if 
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w — aq for a Cz A and q G . Note that the formula X{y) = w is a purely existen- 
tial FO^-formula with quantifier depth \w\ — 1. Universal quantification is given dually by 
(Va; (/j)<xu) = -'{Bx -'ip)<:xw As usual, swapping the variables x and y yields the correspond- 
ing constructions for y. 

Next, we give the construction of (</9)>xu)- Let {^)>xw = ^ ii f is an atomic formula. 
Boolean combinations are similar to the above. For existential quantification let 

{3xip)>xw = 3x {{3y < x: X{y) ^ w) A {ip):^xw)- 

Here X(y) = w states that the factor w ends at position y; it is defined left-right symmetri- 
cally to X{y) = w. Again, universal quantification and the constructions for y are dual. 
"(H])": This is left-right symmetric to the above. 

Finally, we give the construction for (1^9) [^.^j. Atomic formulae and Boolean combinations 
are defined as before. For existential quantification we set 

{3xip)[^.^] = 3x{^{3y < x: X{y) ^ w) A^{3y > x: X{y) ^v) A {(p)[y.^]). 

Quantification over y and universal quantification are dual. □ 

The relativization of the previous lemma leads to the congruence in the following definition. 
This congruence is our tool for the combinatorics of FO^ in the subsequent proofs. 

Definition 1 Let u,v £ A* . For m,n > Q we let u ~rn,o v and u ~o,ri v. For n > 1 let 

u ~i,n V if u and v are contained in the same monomials 'WiA'^W2 ■ ■ ■ A'^wi with Wi £ A'^ 
and juii-'-w^l < n. For m > 2 and n > 1 let u ~m.n v z/ alphj,(u) = alphj.(w) and 
pref;.(u) = prefj.(w) and sufFfe(u) = sufffc(u) for all k < n, and all of the following hold: 

1) if u — U1WU2 and v — V1WV2 with 1 < < n such that the factor w has a unique 
occurrence in the prefixes uiw and viw, then ui ~m-i,n-\w\ '>^i U2 ~m,n-\w\ "^2, 

2) if u = U1WU2 and v = V1WV2 with 1 < \w\ < n such that the factor w has a unique 
occurrence in the suffixes WU2 and WV2, then ui ~rn,n-\w\ "^^i '^"'^ ^^2 ~m-i.n~\w\ ^2- 

3) if u = Uiwu2w'u3 and v = Viwv2w' with \ww'\ < n such that the factor w has a unique 
occurrence in the suffixes wu2w'u3 and wv2w'v3 and such that the factor w' has a unique 
occurrence in the prefixes Uiwu2w' and Viwv2w' , then U2 ~m-i.?i--|totD'| '^2- 

An elementary verification shows that ~m^n is a congruence. Since this is not used in this 
paper, we do not record this fact as lemma. 

Lemma 6 For all u,v A* and m,n > 1 we have that u ~m,n v implies u v and 

Proof: The second claim is obvious. The first claim is also clear if m = 1 or if m > 3. 
It remains to consider the case m = 2. Let u «2,n v. We shall show that u £ P implies 
V G P for all P = A'^wi ■ ■ ■ A~^wg with Wi G A'^ such that \wi ■ ■ ■ wi\ < n. This is true 
if ^ = 1 because suff|iu^|(u) = suff i^^i (w). Let u — uiWiU2 and v = V1W1V2 such that in 
both prefixes uiWi and viwi there is a unique occurrence of the factor wi, i.e., consider 
the first wi of u and of v. Note that since alph|j„^|(u) — alph|„,^|(w) such a factorization 
of V exists. By definition, U2 ~2,n-|iiii| ^^2- Moreover, U2 £ A'^W2 ■ ■ ■ A'^wi and by induction 
£ A'^W2 ■ ■ • A+Wf. Moreover, vi £ because pref|^^|(M) — pref|„^|(w) and thus u £ P. 
Now, if M £ wiA'^W2 ■ ■ ■ A'^wi with u = wiu' , then v = wiv' and by definition u' ~2,n-|toi| 
The above claim yields v' £ A'^W2 ■ ■ ■ A'^wi, i.e., v £ wiA~^W2 ■ ■ ■ A'^wi. Symmetry yields 
u »i,„ V. □ 
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The next lemma connects FO^j „ with the combinatorial properties captured by ~m,n- 
For u,v £ A* let u =i^„ w if u and v model the same formulae in FO^ „[<, sue, min, max]. 
For m > 2 and u,v £ A* let u =m,n v if u and v model the same formulae in FO^_„[<, sue]. 
For technical reasons, we have to include min and max predicates at level 1. 

Lemma 7 For all m,n >0 we have that u =m_n+i v implies u ~m,n v. 

Proof: To prove the claim, we give a shortcut which extends the label predicate. Let 
Wi G A~^ and be a FO^-formula with free variable x. The formula \{x) — {wi , . . . ,Wi,^) 
with free variable x requires that, starting from the position x, there be occurrences of the 
factors wi, . . . ,we in this order such that on the last position i of the occurrence of W£ 
the formula ip{i) hold. More precisely, for every u G j4+ and every x G pos(w) we have 
u \= (A(x) = {wi, . . . ,W£,ip)) if and only if there exist V£ G A* and Vi G 1 < i < £, 
such that u[x; \u\] = wiVi ■ ■ ■ wivi and u \= ip{i) for i — x + \wiVi ■ ■ ■ Wi\ — 1. Formally, for 
It; — aw' with w' G A* we define X{x) = (w, ^p) by 

X{x) = a A3y: (suc(x,?/) A \{y) = (w'jijj)) if |ti;'| > 0, 
X{x) = a A ijj{x) else. 

We extend this notation inductively to sequences of factors by setting A(a;) = (wi, . . . ^Wi^tp) 
to A(a;) = {wijtp') where ip'{x) = 3y > x: ^snc{x,y) A (A(y) = {w2, ■ ■ ■ ^wejipiy))) ■ As 
usual, ipiu) is obtained from iJj{x) by interchanging the variables x and y. li £ = 1 and 
= T, then we simply write A(a;) — wi. 

Suppose m = 1 and n > 1. Let P — wiA'^W2 ■ ■ ■ A'^we with Wi G ^4+ and • • • wi\ < n. 
Then the formula 

ip = 3x: mm{x) A (^X{x) — {wi, ... ,wi, max{x))^ . 

is in FO^ „[<, sue, min, max]. We have m G P if and only if u ^ if and only if v ip if 
and only if w G P. The second equivalence holds by assumption. 



Suppose m > 2, n > 1 and u 


771 


n+l V. 


Let 




3x: 


(A(x) 


-KT)), 


V2 = 


3x: 


(A(x) 


= KT)) A (Vy: X < y). 


f3 = 


3x: 


(A(x) 


^{w,yy: y < x)), 



for w G A'^ with |?x;| < n. Then by assumption u \= ipi if and only ifv\=(pi. This respectively 
yields alph^,(w) = a\phi^{v), pTefi^{u) = prefj,(?;), and sufffe(w) = sufi'fc(u) for all k < n. Note 
that <p2 and (fs may have quantifier depth n + l. It remains to show Definition [T] (HHS]) . 

Definition [T] (ID) : Let u = Uiwu2 and v = viwv2 with 1 < \w\ < n such that the factor w 
has a unique occurrence in the prefixes uiw and viw. First, we show U2 ~m,n-lu)| ^2- 
Using the assumption and Lemma [5l we see that for all (p G FO^„ [<, sue] we have 

U2 \= ip if and only if tt |= ((p)>xio if and only if w |= {p)>xw if and only if V2 \= p- 
Note that {p)>xw G FO^„ „_|_]^[<, sue]. Thus U2 =m,n-\w\+i ^2 and induction on n yields 

U2 ~m,n-\w\ ^2- 

Next, we show ui ~m-i,n-\w\ vi. Suppose m — 2 and let P ~ w\A^W2 ■ ■ ■ A'^wi with 
\wi • ■ ■ wi\ < n — \w\. Let 

^|^{x) = {3y: snc{x,y) A X{y) = w) A -^{3y < x: X{y) — w) and 
p EE 3x: (yy: X < y) A X{wi , . . . , i/")- 
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Note that Lp e FO2 „[<, sue]. By assumption we get ui G P if and only if u ^ if and only 
\i V \= ^) \i and only if v\ G P. This shows u\ ~\.rL-\w\ Suppose now m > 3. Using the 
formulae {ip)<:xw G FO^„ „_|_]^[<, sue] from LemmajSJ we see ui ^ ip if and only if u ^ {f)<Xw 
if and only if w |= {(p}<:xio if and only if ui \= tp for all ip G FO^_;^_„_|^,|_|_i[<, sue]. Thus 
Ml =m-i,n-\w\+i vi and inductively m 

~m — l,n— \w\ 

Definition [T] : This is left- right symmetric to the above. 

Definition [T] ([3]) : Let u = ui'wu2w'u^ and v = Viwv2w'v3 with \ww'\ < n such that the 
factor w has a unique occurrence in the suffixes wu2w'u3 and wv2w'v3 and such that the 
factor w' has a unique occurrence in the prefixes uiwu2w' and viwv2w' . First suppose m = 2 
and let P — wiA^W2 ■ ■ ■ A'^wi with \wi • • • w^j < n — \ww'\. Let 

Tpix) = {3y: suc{x,y) A X{y) = w) A {-By < x: \{y) = w) and 

(fi EE 3x: {3y: suc{y,x) A X{y) ^ w) A {-^3y > x: X{y) = w) A {X{x) = (wi, . . .^Wi^ip)). 

Note that G FO2 „[<,suc]. We have 1*2 G P if and only if u G (,c if and only if u G 
if and only if 112 G ^' where the second equivalence holds by assumption. This shows 
U2 ~i,n-\ww'\ ^2- For TO > 3 usiug the relativization of Lemma [S] ([3]) yields that u =rn,n+i v 
implies U2 =r,i^i,n-\ww'\+i «2- Induction thus yields U2 ~r,i-i,n-\ww'\ ^2- □ 

In other words the previous lemma shows that =m.„+i is a refinement of ~m,n- In par- 
ticular, Wm.n has finite index. The next lemma essentially is an auxiliary statement used in 
the proof of Lemma [HI It says that equivalence of u and v allows order comparison for 
special factors in the words u and v. 

Lemma 8 Let u,w G v4+ he words and suppose u = xiUi • ■ ■ XkUk = u[yi---u'^yi and 
V — xivi ■ ■ ■ XkVk ~ v'lyi ■ • ■ v'^yi with k,i > 1, u[,v[,Uk,Vk S A* and Xi,yi G are 
factorizations such that 

• XiUi ■ • ■ Xk is the shortest prefix of u contained in xiA'^X2 • ■ • A'^Xk and 
XiVi ■ ■ • Xk is the shortest prefix of v contained in xiA'^X2 ■ • ■ A~^Xk, 

• yi • ■ • u'fiji is the shortest suffix of u contained in yiA^y2 ■ ■ ■ A'^yi and 
yi ■ ■ ■ v[yi is the shortest suffix of v contained in yiA~^y2 ■ ■ ■ A^yi. 

Let An ~ \u\ — \xiUi ■ ■ ■ Uk-i\ — 1^2 • • • u\y(,\ and let = \v\ — \xivi ■ ■ ■ Vk~i\ — 1^2 • • • v[yi\. 
If u V for n — \xi ■ ■ ■ Xk \ -\- | j/i • • ■ J/^I, then the relative order of the occurrences of Xk 
and yi is the same in u and v, i.e., one of the following conditions applies: 
1) A„ > \xkyi\ and A„ > \xkyi\. 

A„ < and Ay < 0. 
3) A„ = A„. 

Proof: Suppose that A„ > jx^-yil, i.e., in u all positions of the occurrence of yi are at 
least by one greater than all positions of the occurrence of Xk- Then u is contained in the 
monomial P = cci • • • XkA~^yiA~^ ■ ■ - yi- By assumption, v is also contained in P and thus 
A„ > \xkyi\- Symmetry yields A„ > |a;fcyi| if and only if A^ > \xkyi\. 

Suppose that < A„ < |xfcyi|, i.e., in u the occurrences of Xk and yi are adjacent or 
overlap each other. By the above, we can assume A^ < |xfe?/i |. Let 1 < i < A: be minimal such 
that \xiUi ■ ■ ■ Xi\ > \u[ \ and let 1 < j < ^ be maximal such that • • • | < \xiUi ■ ■ ■ Xk\- 

Let p be the shorter of the two prefixes u'l and xiui ■ ■ ■ xi^iUi^i of u. Similarly, let q be the 
shorter of the two suffixes Uk and u'^j^^y^j^i ■ ■ ■ u'f^yi. Then z = p^^uq^^ is the factor of u 
corresponding to all positions of Xm's and j/m's which are adjacent to or overlapping with Xk 
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or 2/1. We have that u is contained in xiA^ ■ ■ ■ Xi^iA^ zA^yj-^-i ■ ■ ■ A^yi and by assumption 
so is V. Note that • • • Xi-izy.j+i ■ ■ ■ ye\ < n. Hence, relative to the occurrence of yi, the 
occurrence of Xk cannot be further to the right in v than in u, i.e., < A„ < A„ < |xfcyi|. 
By symmetry < A„ < if and only if < A^, < Ixfej/ij; and if this is the case, then 

A„ = A„. 

By the above we also get A„ < if and only if A„ < 0. This shows that at least one of 
the conditions ([Ij, (0), or ([3]) applies. □ 

The main combinatorial ingredient to go from W„j to FO^ is the factorization given in 
the following lemma. It combines our findings so far regarding LDA and ~rn.n- 

Lemma 9 Let S G LDA, let m > 2, let N ^ 2 \S\^ , and let u,v G with u ^m,n+N v. 
There exist factorizations u = wqSiWi ■ ■ ■ siwg and v — wotiWi ■ ■ ■ tfWi with Wi, Si, ti G , 
and \wq ■ ■ ■'Wi\ < N such that for all 1 < i < £ the following hold: 

Ij Si ~m— l,n ^z? 

2) wqSi ■ ■ ■ TZ wqSi ■ • ■ Wi-iSi in S, 

3) Wi ■ ■ ■ tiWi C tiWi ■ ■ ■ tiWi in S. 

Proof: Let X' = {l}U{i G N | I < i < \u\ , u[l; i - 1] >tz u[l;i] in S} be the set of positions 
of u which cause an 7?.-descent when reading from left to right. Let X be the set of positions j 
such that there exists i € X' with < i — j < \S\, i.e., we include all \S\ positions 
to the left of each i £ X' . Let Y' and Y be defined left-right symmetrically on v, i.e., 
Y' = {|t'|}U{i e N I I < i <\v\, v[i - \; \v\] >c u[i; \v\] in S} and Y is the set of positions j 
such that < j - i < l^l for some i e Y' . Let X = Xi U • • • U Xfc where ^ are 
maximal subsets of consecutive positions of X such that all positions of Xi are smaller than 
all positions of X^+i . Symmetrically, let F = U • • • U Yk' where l^i 7^ are maximal subsets 
of consecutive positions of Y and such that all positions of Yi are smaller than all positions 
of 

Let Xi — u[Xi] and yi — u[Yi\ be the factors of u and v covered by the positions of Xi 
and Yi, respectively. By construction and Lemma U] ([T]) , we see that u[l;max(Xi)] is the 
shortest prefix of u which is contained in xiS^X2 ■ ■ ■ S^Xi. Symmetrically, t;[min(Yi); \v\] is 
the shortest suffix of v which is contained in yiS^yi+i ■ ■ ■ S^yk' by Lemma |3] (O ■ We use 
these properties to transfer the positions of X to w and the positions of Y to u. More precisely, 
we let Y" — Y]"U- • -UY^!,' be such that each Y"/' is an interval of positions oiu with u[yi"] = yi 
and u[min(y/'); is the shortest sufHx of u which is contained in yiS'^yi+i ■ ■ ■ S^yk' . And 
we let X = X"U- • -UX^' be such that each X" is an interval of positions of v with w[X-'] = Xi 
and v[\; max(X")] is the shortest prefix of v which is contained in xiS^X2 ■ ■ ■ S^Xi. Note 
that u G S*yiS^y2 ■ ■ ■ S^yk' and v G xiS^X2 ■ ■ ■ S^x^S* since u ~m.n+N v. 

Now, consider the factorization u = wqSiWi ■ ■ ■ SiWi with Si G such that the Wi are 
the factors covered by maximal subsets of consecutive positions of X U Y" . Intuitively, this 
means that we merge overlapping and adjacent factors Xi and yj in u. Lemma [5] shows 
that the relative order of those concrete occurrences of Xi and yj in u and in v is the same. 
Therefore, if we consider the factorization of v which is covered by maximal subsets of 
consecutive positions in X" U Y , then we end up with the same factors in the same order, 
i.e., we have v — wotiWi ■ ■ ■ tiWg for some ti G 5"+. 

Since the 7^-class and the £-class can descend at most \S\ — 1 times, we have \X' U Y'\ < 
2 \S\ and thus |wo • • • we\ < \X U Y"\ < 2 \S\^. Moreover, by construction, every 7?.-descent 
when reading prefixes of u as well as every £-descent when reading suffixes of v is covered 
by some factor Wi showing ^ and ([3]). 
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It remains to show Si ~m-i,n U for all i. An intermediate step is the following claim. 

Claim // SkWk ■ ■ ■ SiWi «m,„+Ar tkWk ■ ■ ■ tewi for some N > \wk-- - wil, then «m_i,„ U 
for all i G {k, ...,£}. 

The proof of this claim is by induction on £ — k. Every Wi either arises from some x^'s 
or some j/j's or both. Therefore, the w^'s inherit the properties of the corresponding Xj's 
and yj 's of being the first occurrence (respectively being the last occurrence) . If there is no 
Wi arising from an Xj, then every Wi has a unique occurrence in WiSi+i as well as in Witi+i. 
Thus Si ~m-i,ji ii for all i by an [£ — /c)-fold application of condition ([2]) in the definition of 
~m.n (from right to left). For i = k this uses Lemma [S] 

Fix the first Wi which arises from an Xj . We have Sj ~m~i,n tj for all j > ihy condition ([1} 
in the definition of ~m,n and induction. If i = A:, then Sk ~m-i.n tk again by condition ([T]) 
in the definition of ~m,n- Assume therefore i > k m. the sequel. Let h > i he minimal 
such that Wh arises from some y-,-; note that arises from y^i. By a repeated application 
of condition ([2]) in the definition of ~ra,n we get that SkWk ■ • • s/i ~m.n+N' tkWk ■ ■ 'th for 
A'"' = ■ ■ -Wh-il- Now, Wi-i has a unique occurrence in each of the words Wi-iSi ■ ■ ■ Sh 
and Wi-iti ■ ■ -tfi- Therefore, by repeatedly applying condition ^ in the definition of ~m,n 
we see that sj ~,n-i,ri tj for all k < j < i. If h > i, then by condition ^ in the definition 
of ~m,n we see that Si ^m-i,n ti\ and if h = i, then this follows from condition Q in the 
definition of ~m,Ti- This concludes the proof of the claim. 

Now by condition ([T]) in the definition of ~m,m we see siWi ■ ■ ■ SiWi ~m,n+N' tiwi ■ ■ ■ tiwg 
for N' = N — \wo\ and the above claim yields Sj ^m-i.n tj for all I < j < (■ □ 

The following proposition essentially shows how to pass from W„i to FOf,J<,suc]. The 
key to its proof is a string rewriting system which enables induction on the parameter m. 
Intuitively, for S G W„i we consider the maximal quotient contained in W,„_i. Since the 
latter is given by an omega-identity, this quotient can be described by a string rewriting 
system. A single rewriting step of this system corresponds to one application of the omega- 
identity for Wm_i and can be lifted to relatively easily. 

Proposition 1 For every S G W,„ with m > 1 there exists n > 1 such that u !^rn,n v 
implies u — v in S for all u,v E . 

Proof: We perform an induction on m. By Knast's Theorem |9j, if L is recognized by 
a semigroup in 5 G Wi, then the language L is a Boolean combination of monomials 
'WiA'^W2 ■ ■ ■ A^Wi. Choosing n > 1 such that for all these monomials we have jwi • ■ • w^j < n 
yields the claim for m — 1. 

Let uj > \S\ be an integer such x'^ is idempotent in S for all x E S. Consider the 
relation — > on given by s ^ f if s = i in 5 or if s = pUm-iQ and t = pVm-iQ for some 
p,q e S* and some Xi,e,yi, f,p^,q^,z,z' G 5+ such that ui = (e"2:/"xie")"z(/'^yie"2:7")'^ 
and vi = (e"z/"a;ie")"z'(/"?/ie"z7")" and for i > 2 we have 

Ui = {piUi-iqiXiYpiUi^iq.,{yiPiUi^iqiY , = {piUi-iqiXiYpiVi^iqi{yiPiUi^iqiY . 

Let Pf be the reflexive, symmetric and transitive closure of — >. The relation -A- is a congru- 
ence of finite index (since 5*+/ A is a quotient of S). Moreover, x'^ A x^'^ for all x G 
and S+ E^M„^-l. 

Claim 1 Let u, s,t G . If s t, then u TZ us in S if and only if uTZ ut in S . 
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Assume without restriction that s ^ < in S. By construction of Um-i and Wm-i, we have 
alph|gj_|_]^(s) — alph|5|_|_]^(<:). Note that by choice of w, in particular both words have the 
same prefix and the same suffix of length \S\ + 1. Lemma [3] yields the claim. 

Claim 2 Let u, v, s,t £ with s -P^ t. If uTZ us and v C tv in S, then usv = utv in S. 

Since s A t, there exists fc > and wq, . . . ,Wk £ such that s — wq and — t and such 
that either Wi-i — )■ Wi or Wi — >■ Wi-i for each \ < i < k. Claim 1 and its left-right dual, yield 
that u TZ uwi and v C WiV in 5" for all i. It therefore suffices to show the claim for s — > The 
claim is trivial ii s — t in S. Otherwise suppose s — PmUm-iqm and t = PmVm-ilm- Since 
u 72. us in S, there exists Xm G S such that u = usxm in S. Since v C tv in S the left-right 
dual of Claim 1 implies v C sv in S. Hence, there exists y„i £ S such that v = ymsv in S. 
Now, u = u{pniU„i-iq„iXmY in S and v = {ymPmUm-iqinY v in S and with S G we 
see 

usv — ui^PjyiUjyi — iQ^Xjji^ Pm^m~-lQmiymPm^m—lQm) ^ 

= u{pmUm-iqmXrnTPmVm-iqrn{ymPmUm-iqmTv = utv in S, 

thus establishing the claim. 

Since S'^/<^ G W„i_i, by induction there exists n > 1 such that s ~m-i,n t implies 
s A t for all s,t E 5+. Let w, w G 5*+ and suppose u ^m,n+N v for N ~ 2 15*1^. Let 
u — wqSiWi ■ ■ ■ sewi and v — wotiwi ■ ■ ■ tiwi be the factorizations given by Lemma[21 in par- 
ticular Si ~m-i,n ti and wqSi ■ ■ ■ Wi-i TZ wqSi ■ • ■ Wi-iSi in S and Wi ■ ■ ■ tgwi C tiWi ■ ■ • tiwi. 
By choice of n we have Si A ti for all i and repeated application of Claim 2 yields the 
following chain of identities valid in S: 

V = wotiWit2 ■ ■ ■ ti^iWi^itewe 
= WQSiVJit2 ■ ■ ■ ti^iwi^itiwe 
= WQS1VJ1S2 ■ ■ ■ ti^iwe-iteWi 

= W0S1W1S2 
= W0S1W1S2 

This concludes the proof. 



• se-iWi-it^Wi 

■ sg^iwi^isewe = u. 

□ 



3.2 From First-Order Logic to Algebra 

Next, we shall show the reverse direction, i.e., languages definable in FO^ with successor 
are recognizable in W^. Our proof is inspired by Straubing's proof showing that FO^[<]- 
definable languages are recognized in the so-called weakly iterated two-sided semidirect 
product ((J J) J) • • • ** J where J appears n times [55]. However, to avoid technical 
notation we do not use semidirect products. 

An intermediate step towards this is the following lemma reducing the alternation depth by 
encoding some information into an extended alphabet. To formalize this, for n > 1 consider 
the length-preserving map a„ whose x*'' letter of the image of u is given by {Vn{u[l;x — 
1]), a, Vn{u[x + 1; |u|])) where a is the x^^ letter of u and where for v £ A* 

Vn{v) = {(wi, ■ . ■ ,we) \ V e wiA'^w2 ■ ■ ■ A'^we, G A* , \wi - ■ -wi] <n, 1 < ^ < n} . 
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Note that Vn{v) = {(1)} if and only if u = 1. We view an{u) as a word over the alphabet 
2^ X A X 2^ with B = {{wi,. . . ,we) \ Wi e A*, \wi ■ ■ ■ we} < n, 1 <i < n}. Note however 
that a„ is not a homomorphism. 

Lemma 10 For all ip{x) € FOi^„[<,suc] with free variable x there exists a formula (f'{x) € 
FOq o[<) sue] such that for all u G A'^ and all positions x e pos(m) we have 

u \= ip{x) if and only if q:„(m) \= ^'{x). 

For allm>2 and all ip G FO^„[<,suc] there exists a formula if' € FO^_i „[<,suc] such 
that for all u e A'^ we have 

u\= if and only if a„(?i) |= </?'. 

Proof: First, suppose that <p{x) is a purely existential formula in FO^[<,suc] with quan- 
tifier depth n. Using the standard procedure (which introduces new variables), we get an 
equivalent formula with quantifier depth n in prenex normal form, i.e., 

(p{x) = 3xi ■ ■ ■ 3Xn ^' 

where ip' = (p'{x,xi, . . . ,Xn) is quantifier- free. 

In the next step, we make expHcit the order type and the label of the variables. To do 
this concisely, we introduce the following vector notation. We denote by z = {zi,. . . , Zk) a 
vector of variables and set \z_\ = k. Let 3z be a shortcut for 3zi ■ ■ ■ 3zk, let A(z) = w he 
a shortcut which states that zi < ■ ■ ■ < Zk are consecutive positions labeled by the word 
w. The shortcut z < y requires that all positions of z be smaller than all positions of y, 
and the shortcut -'suc(2:, y) states that no position of y is the successor of any position of z. 
For a fixed ordering and labeling of the variables, the truth of all atomic formulae and thus 
also of (p' is completely determined. We now consider all possibilities to order and label the 
variables such that ip' is true. Reordering variables and congregating consecutive variables 
to vectors we end up with a finite disjunction of formulae ip{x) of the form 

3zi<x---3zk<x 3zk+i > x---3zg> x: {X{x) = a) A /\^ {Zi_-^ -C A {X{zi) = Wi)) A x 

for some Wi G A'^, some a € A and some x of the form xi A X2 where 

Xi e {suc(zj^,.t).^suc(zj^,.t)}, X2 e {suc(a;,Zfe_^i),^suc(x,Zfe_^i)} . 

The variable x separates Zj from Zj iov i <k < j. This allows to split at index i to get 
conjunction of \{x) = a, of formulae of the form 

<;(x) = 3z^<x--- 3z^ <x: < A {\{zi) = Wi)) A Xi 

and of similar formulae <;'{x) involving the variables z^ with j > k. 

Now, for every model u € A'^ every position x € pos(w) the truth of w ^ <;{x) is completely 
determined by the set T'„(u[l; .t — 1]). If for example Xi = suc(zj,, x), then u \= (;{x) if and 
only a u[l; X — 1] G A*wi ■ ■ ■ A~^Wk if and only if {(1, Wi, . . . , w^), {wi, . . . , Wfe)}n'P„(u[l; a; — 
1]) ^ 0. A symmetric argument shows that <?'(a;) is completely determined by the set 
Vn{u[x + 1; \u\]). Therefore, u \= ip{x) is completely determined by the a;*'' letter of Q!„(m), 
i.e., there exists a formula i^'ix) which is a Boolean combination of labels over the alphabet 
2^ X Ax 2^ such that u \= tp{x) if and only if a„(w) |= tp'{x). This implies the first part for 
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purely existential formulae. For a purely universal formula ^p{x), we consider its negation 
and use the equivalence -i(Va; ^) = 3x -i^ and De Morgan's laws to move the negation inwards 
so that no negation ranges over any quantifier. Let ipi^) be the resulting purely existential 
formula. Setting (p'{x) = yields the claim. The extension to Boolean combinations 

of quantifier free formulae is straightforward. This concludes the first part of the proof. 

For the second part of the claim let to > 2 and ip G FO^ sue]. Without restriction, we 
may assume that no negation ranges over any quantifier. Consider an innermost quantified 
block of if, i.e., a maximal subformula -0 of of the form 3y or of the form V?/^ where 
all quantifiers in -0 are of the same type. Note that y does not appear freely in and thus 
V' = '^{x) is a formula with one free variable x. We replace every innermost quantified block ip 
by the formula ip' of the first part of the lemma. Note that we indeed have G FO^ „[<, sue]. 
This decreases the alternation depth by one since the if) are maximal. It might happen that 
the resulting formulae contains label predicates over the alphabet A. The formulae tp' is 
obtained by replacing \{x) = a with a G Ahy the disjunction of formulae X{x) = {P,a,Q) 
ranging over all P,Q (- B. □ 

The next lemma shows that for any given language definable in FO^„[<,suc] there is a 
recognizing semigroup in W^. Let u =m,n v over A'^ if puq \= (p pvq \= ip for all p, q £ A* 
and all p e FO^„ „[<,suc]. 

Lemma 11 Let m,n > 1. For 2 < i < m define the following words over the alphabet 
^ = {xi,yi,Pi,qi,e,f,s,t I 1 < i < m}: 

Ui = piiesfxie)" s {fyietf)"qu = p^{U,-lX^)''U^-l{y^U,-l)"q^, 

Vi ^piiesfxie^tifyietfrqi, V, = p,{U,.iX,YV,.i{y,U,^iTq,. 

Then for all homomorphisms h : I]+ A'^ such that h{e) =m,n h{e^) and h[f) ^m.n ^(/^) 
we have h{Um) —m,n h{Vm). In particular we have A~^/^m,n G '^m- 

Proof: In principle, the claim is that FO^„[<,suc] with order and successor is unable to 
disprove the defining identity of W„j. For conciseness, we use a slightly modified version of 
this identity. The homomorphism h gives a valuation of the variables in S. 

Note that it suffices to show h{U„i) \^ p <^ h{Vm) ^ p all p} £ FOm ri[<jSuc] and all h. 
By incorporating p into the image of pm and q into the image of q^ , this indeed implies 
ph{U,n)q h ^ ph{V,n)q h <^ for aU p,qG A*. 

We perform an induction with to = 1 as base. Consider a sentence p) G FO^ „[<,suc]. 
Even though 95 is a sentence, we may view it as a formula 1^9(2;) in one free variable x whose 
truth does not depend on the interpretation of x. Let p' {x) e FOq q be the formula from 
Lemma [TUl Then we have h{Ui) ^ 1^9 if and only if a„(/i(J7i)) \= <<9'(1) if and only if 
o:n{h{Vi)) \= (p'{l) if and only ii h{Vi) \^ p. The second equivalence holds since h{Ui) and 
h{Vi) have the same short sequences of short factors, i.e., ■P„(/i(C/i)) — Vn{h{Vi)) . 

Let m > 2 and consider Urm Vm and a valuation h : as defined in the statement 

of the lemma. Expanding the definition yields the following structure of Um and Vm : 

Um = Pm{Um-lXmT ' ' " P2 (C/ia;2 )" Pi (cs/xi e)"s(/?/iei/)"gi (y2t^l)"'72 • • • {y-mUm-lTqra, 
Vm =Pm{Um-lXmT ' ' " P2 (t^ia;2 )" Pi (es/xi e)"t (/?/iei/)"gi (l)2UlYq2 ■ ■ ■ {ymUm-lTq-m- 

Since h{e) and h{f) are idempotent in yl+/^m_„, we may assume without restriction that 
• h{e) and h{f) have lengths at least n, and 
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• h{e) is a prefix of h(s) and of h{t) and a suffix of h{pi), of h{xi) and of h{yi), and 

• is a suffix of h{s) and of h(t) and a prefix of h(qi), of h{xi) and of h{yi). 

Let J/^j = an{h{Um)) and = Q;„(ft.(ym))- We now want to show that the factors of 
U'^ and which come from the central factors Um-i (in Um) and Vm-i (in K„) can 
be obtained as the image of Um~i and Vm-i under some valuation h' : S+ — ^ where 
C = 2^ X A X 2^. 

Let h{Um) = pq with p being a prefix of h{pm{Um-iXm)"'---p2{UiX2)"'Pi{esfxieY). 
Then /i(Kn) = P?' for some q' and we have Vn{q) ~ 'Pn{q')- Symmetrically, if h{Um) — pq 
for some sufhx q of h{{fyietf Yqi{y2UiYq2 ■ ■ ■ (ymC/m-i)"?™) , then /i(Ki) = p'q for some p' 
and Vnip) = 'Pnip')- This in particular implies that and y/„ have the same prefix p of 
length |/i(pm(C/„_ia;„)")| and the same suffix q of length |/i((?/mC/„i_i)"9„)|. Let = 
p~^U^q~^ and V^^-i = P^^Kn?^^- We define a homomorphism h' : S+ — > C+ by setting 
for € S 

/i'(z) = c-i''(f')i t/;, c-i''('')i 

ifp',?' e S* are such that [/„ =p'zq' and > |pm(C/m_ix„)"| and \q'\ > \{y,nUm-iTqm\- 
In other words, we take an arbitrary occurrence of z in the central part of Um and define the 
image h'{z) as the factor of which corresponds to this occurrence. We now show that 
this mapping is well-defined, i.e., the value h'{z) does not depend on the concrete location 
of z chosen. 

Suppose W — Um OT W — Vm and Wm — p"zq" for some p",q" with \h{p")\ > 
\Pm{Um-iXm)"'\ and |/i(9")l ^ \{ymUm-i)"'qm\ is another occurrence of z in the central 
part of Um or Vm- By the assumptions on the valuation h, both occurrences of z have the 
same surrounding of n letters under the valuation h, i.e., h{p'), h{p") have the same suffix of 
length n, and h{q'), h{q") have the same prefix of length n. Let P — A*w\A^W2 ■ ■ ■ A^Wn-A* 
with Wi G and \w\ ■ ■ ■ wi\ < n. Then h{p') G P if and only if h{pm{Um-iXm)") G P ii 
and only if h{p") £ P; this holds because h{{Um-iXm)") contains short factors sufficiently 
often. Symmetrically h{q') G P if and only if h{q") G P. Moreover, wiA'^W2 ■ ■ ■ A'^Wn = 
P n wiA* n A*Wn- Thus if h{z) = ziZ2, then the sets 'Pn{h{p')zi) and 'Pn{z2h{q')) do not 
depend on the concrete location of the variable z given by p' or q' , i.e., we have 

C-W)\ T^c'-I''(9")l ^ h'i^z). 

In other words, a„ annotates all occurrences of all variables within the central factors Um-i 
and Vm~i consistently so that U'm-i = h'{Um~i) and Vm_i — h'{Vm-i)- Moreover, we 
may assume that h'{e) and h'{f) are idempotent in C+/?^„i-i.n- (Else let cj > 1 be an 
integer such that a;" is idempotent for all a: G /=m-i,n- And, instead of h, we consider 
the valuation h which maps e to h{e'^) and / to h{f'^) and all other 2: G S to h{z). The 
construction of h' then yields h'{e) ~ ft.'(e)".) 

Induction yields U'm^i —m-i,n Kn-iJ and, because ^m~i,n is a congruence, we see 
U^ ^m~i,n Kri- Consider a formula (p G FO^„ „ and let ip' G FO^m-i.n be the formula 
from Lemma [TUl Then h{Um) H if and only if [/^ ^ ip' if and only if V^^ ^ ip' if and only 
if hiVm) h This shows h{Um) =m,n h{Vm)- □ 

3.3 Temporal Logic and Completing the Proof 

It remains to incorporate temporal logic into the picture. The following lemma is the hard 
part from FO^[<, sue] to TLm[X, F, Y, P]. The proof is done by a straight- forward syntactic 
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translation. This can be also seen using a construction due to Etessami, Vardi, and Wilke [71 
proof of Theorem 1] though their statement does not mention the alternation depth of the 
resulting temporal logic formula. 

Proposition 2 Let m > 1 and n>0. Every language definable in FO^„[<,suc] is de/in- 
a6/e m TL„,3„[X, F,Y, P]. 

Proof: The construction is by induction on the structure of the formula. For the inductive 
step, we also have to take free variables into account. Let tp{x, y) G FO^ „[<, sue]. We start 
by some normalizations on the structure of (p. We assume without restriction that on all 
paths in the parse tree of ip no two successive quantifiers bind the same variable. Therefore, 
by starting the construction with a sentence having this property, we can assume that the 
first quantifier on every path in the parse tree of p binds the variable y, i.e., when thinking 
of (/J as a subformula of some sentence, then on the path to ip the previously bound variable 
is X. Moreover, we replace all universal quantifiers Vzi/' by ^Elz^V- 

For integers i, j let ord(i, j) S {^, —1, 0, +1, ^} be their order-type defined by 

< if « < j - 1 
-1 iii=j-l 
ord(i, j) ~ \ if « = j 

+1 if z = j + 1 
> \ii>j + \. 

Let / e {<C — 1, 0, +1, ^} and a £ A. We show that there exists a formula {p)i^a{x) € 
TLm,3n[X, F, Y, P] such that for all u ^ A* and all positions i,j € pos(it) with u[j] ~ a and 
ord(z, j) = / we have u \= p{i,j) if and only if u |= {p)i^a{i)- 

The construction is by induction on the structure of the formula. Let {T)j^a{x) = T, 
{l.)i^a{x) = -L and for the other atomic formulae we set 

{X{X) = b)j^a{x) = b, 

It if a = 6, 



{x < y)i,a{^) 

{suc{x,y))i^a{x) 



1. otherwise, 

T if/G{«,-l}, 

± otherwise, 

't if/ = -l, 

± otherwise. 



The formulae (y < x)i^aix) and (svLc{y, x)) j^a{x) are defined similarly. Conjunction is given 
by {p A %lj)i^a{x) = {p)i^a{x) A {ip)i,aix) and for disjunction we define {p V '(l:)i^a{x) = 
{v)i,a{x) V {tp)i^a{x). For negation we set {-ip)i_a{x) = -i{pi) i ^a{x) ■ For existential quantifi- 
cation we let (3y: p)j^a{x) be given by 

y bA{YYP{p)<<My)'^^{^)-iAy) V Mo,fc(?/) V X((^)+i,,(2;) V XXF(^)»,fc(y)). 

beA 

In the construction of the formulae {p)j.b{y), the roles of x and y are interchanged. Note 
that due to our normalization at the beginning, we do not have to handle quantification 
over X. 
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Now, we inductively define {ip) G TL„i_„[X, F, Y, P] for sentences ip 6 FO^j „[<, sue]. 
Boolean connectives are straightforward: {ip A ip) = {(p) A {ip), {ip V ijj) = (ip) V (ijj), and 
(-199) = ^{(p). For quantification we set {3y: (p) = F{ip)i^a{y) where / G {<C, — 1, 0, +1, 
and a £ A are arbitrary. Hence if L = L{ip) for some sentence ip G FO^„ „[<, sue], then 
L = L((^)) for (^) GTL™,3n[X,F,Y,P]. ' □ 

We are now ready to prove Theorem [1] 

Proof (TheoremU^: We shall show "((1]) ^ ©" and "(H]) 
"((II) ^ (0)": This follows by Proposition [H 

"([2|) (HI": This is trivial since the semantics of temporal logic formulae is given by 
two-variable first-order expressions. 

"(HI) ^ (IS|)": By Lemma [TT] the semigroup S = (A+/=m^„) is in W^. Every language 
L C j4+ definable in FO^„[<,suc] is a union of =m^„-classes and hence recognized by S. 
The syntactic semigroup of L is a quotient of S and thus is also in W^. 

"(I3|) (HI)": Suppose the homomorphism h : S with S G W^, recognizes L C A+. 

Combining Proposition [T] and Lemma [71 we see that there exists an integer n > 1 such that 
u =m,n V for u,v € implies u = v in S. Now, if u =m,n v for u,v £ A+, then h{u) = h{v). 
Thus, by specifying the =m,ra-classes of which are contained in L, we obtain a formula 
iy9 G FO^„[<,suc] such that L{ip) = h^^{h{L)) = L. Note that the syntactic semigroup 
of L recognizes L. □ 



Conclusion 

We showed that quantifier alternation for the logic FO^[<,suc] is decidable by giving a 
single identity of omega-terms for each level FO^[<,suc]. The key ingredient in our proof 
is a rewriting technique which allows us to apply induction on m. 

There is an algebraic construction V 1— >■ V * D in terms of wreath products, see e.g. |25| . 
For most logical fragments T, whenever J" corresponds to a variety of finite monoids V, 
then the fragment obtained from by adding successor predicates corresponds to the 
semigroup variety V * D. This is also the case for FO^[<] and FO^J<,suc]. Therefore, 
if Vm is the variety of finite monoids corresponding to FO^[<], then our result implies 
* D = W^. 

In general, decidability of V is not preserved by the operation V 1— >■ V * D, but a particu- 
larly nice situation occurs if V * D = LV. Here, a semigroup S is in LV if all local monoids 
of 5 are in V. For example the variety DA satisfies DA * D = LDA, see [2 |3|. For Wi 
however, Knast has given an example showing Vi * D 7^ LVi. In view of this example, we 
conjecture that * D 7^ LV„i for all m > 1. 

Acknowledgments. We would like to thank the anonymous referees of the conference ver- 
sion of this paper for many useful suggestions which helped to improve the presentation of 
the paper. 
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